Learning the structure of a Bayesian network: An application of Information Geometry and the Minimum Description Length Principle

نویسنده

  • Eitel Lauria
چکیده

The field of Bayesian Networks has had an enormous development over the last few years and is one of the current key topics of research in the design of statistical machine learning and data mining algorithms. Bayesian networks are a natural marriage between two areas in mathematics: graph theory and probability theory. A Bayesian net encodes the probability distribution of a set of attributes by specifying a set of conditional independence assumptions together with a set of relationships among these attributes and their related joint probabilities. When used in this way, Bayesian networks result in a powerful knowledge representation formalism based on probability and provide a natural way of dealing with uncertainty and complexity, two recurring topics that have impact across a wide range of knowledge domains. The present paper addresses the issue of learning the underlying model of the Bayesian network, expressed as a digraph, which includes the specification of the conditional independence assumptions among the attributes of the model; and given the model, the conditional probability distributions that quantify those dependencies. We heuristically search the space of network structures using a scoring function based on the updated version of the Minimum Description Length Principle, that takes into account the volume of the model manifold [1] [2]. We present empirical results on synthetic datasets that analyse the relative effectiveness of this approach when varying the size and complexity of a Bayesian network. References: [1] J. Rissanen (1996), “Fisher Information and Stochastic Complexity”, IEEE Transaction on Information Theory, 42, 40-47 [2] C. Rodriguez (2001), “Entropic priors for discrete probabilistic networks and for mixtures of Gaussian models”, presented at MaxEnt2001, APL Johns Hopkins University, August 4-9 2001

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation  method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation  method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

Statistical and Information-Theoretic Methods for Data Analysis

In this Thesis, we develop theory and methods for computational data analysis. The problems in data analysis are approached from three perspectives: statistical learning theory, the Bayesian framework, and the informationtheoretic minimum description length (MDL) principle. Contributions in statistical learning theory address the possibility of generalization to unseen cases, and regression ana...

متن کامل

An Introduction to Inference and Learning in Bayesian Networks

Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...

متن کامل

Learning Bayesian Belief Networks Based on the MDL Principle: An Efficient Algorithm Using the Branch and Bound Technique∗

In this paper, the computational issue in the problem of learning Bayesian belief networks (BBNs) based on the minimum description length (MDL) principle is addressed. Based on an asymptotic formula of description length, we apply the branch and bound technique to finding true network structures. The resulting algorithm searches considerably saves the computation yet successfully searches the n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005